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§0.  Introduction 

By  now,  there  have  been  proposed  many  procedures  for  testing 
the  hypothesis  of  the  normality  of  the  distribution.   Recently 
Wilk  and  Shapiro  did  an  extensive  comparative  study  of  several 
tests  thus  far  proposed  with  their  own  [5],  [6].   Also,  a  procedure 
which  was  not  covered  by  Wilk  and  Shapiro's  study  was  given  by 
Kempthorne  [^]. 

In  this  paper,  we  shall  propose  still  another  one,  which  is 
related  to  the  problem  of  estimation  of  the  location  parameter. 
Suppose  that  X. ,...,X  are  distributed  independently  according  to 
some  continuous  distribution,  then 

(0.1)  E{X|X„  -  X, , . . . ,X  -  X, }  =  0  a.e. 

t.  I  2    1'    '  n    1 

if  and  only  if  X-,,...,X  are  normally  distributed  (Kagan,  Linnik 
and  Rao  [2]).   Therefore  if  we  can  somehow  estimate  the  regression 
function  of  X  on  the  sample  differences,  we  may  have  a  testing 
procedure  for  normality.   Since  regression  in  general  is  difficult 
to  deal  with,  we  can  consider  instead  a  linear  regression  function 

(0.2)     X  -  n+  a^(X(2)  -^(1))  +•••  +°'n-l^^(n)  "^(1)^  +^ 

where  X,.,  denotes  the  order  statistic. 
(1) 

Then  normality  implies  a,  =  ...  =a  _^  =0.   Conversely,  if 
a  =  ...  =a  _   =  0  for  all  n,  then  it  implies  that  T  is  the  best 
linear  unbiased  estimator,  which  is  asymptotically  efficient  under 
mild  regularity  conditions  (Chernoff,  Gastwirth  and  Johns  [l]),  but 
X  could  be  efficient  only  for  the  case  of  normal  distribution  (see 
Kagan  [ 3] ) • 


The  regressi  '  .  )  Is  easily 

done  when  we  have  some  repllcat'     C   order  statl: 
same  size,  and  Is  discussed  In  §1  and  §2.   But  since  this  procedure 
ma;  -h  power,  a  more  sophisticated  m*        proposed 

in  §3»  and  its  property  based  on  Monte  Carlo  experiments  are 
discussed  In  §4. 

The  Monte  Carlo  results  indicate  that  the  power  of  our  test 
is  almost  as  high  as  any  other  method  thus  far  known  against 
several  alternatives,  and  sometimes  much  higher  than  others. 

The  method  proposed  in  §3  is  closely  related  to  and  actually 
derived  from  the  author's  method  of  constructing  uniformly  effi- 
cient robust  estimators  of  location  [?]• 


§1.  Exact  Test  Based  on  Replications  of  Order  Statistics 

Suppose  that  we  have  m  independent  sets  of  order  statistics 

of  size  k,  X.  f  ^  .  ^  . . .  <  X.  ,,  ^ ,    i  =  l,...,m  from  a  continuous  dlstri- 
i( 1 )        1 (k) 

bution  with  density  f(x).   We  want  to  test  the  hypothesis  of 
normality,  i.e.  the  hypothesis  that 

(1.1)  f(x)  =  — ^—  exp  -  ^^-^^ 

/27  o         2a 


with  unknown  [i   and  a,  against  the  alternative  of  non-normal  distribu- 
tion. 

Define 

k 


i'ir:\is) 


Y .  .   =  X./.,-,x-X./.x  ,    j  =  l,...,k-l,   1  =  l,...,m 
ij    i(j+l)    i(j) 


and  consider  the  linear  regression  model. 


(1.2)     X.  =ii,  +  a^Y.,+...  +  a,  ,¥.,,  ,,+U.  ,     l  =  l,...,m. 
^    '  1    "^    1  il        k-1  i{k-l)    1  '    ' 

It  should  be  remarked  that  if 

f(x)  =  -  fJ^:iJL)      for  some  f 
a  o'   a  o 

then  a,,.,., a,  ^  in  (1.2)  is  determined  by  f  and  is  independent  of 
the  location  and  scale  parameters  [i   and  a.   And  under  the  hypothesis 
of  normality. 


(1.3)  a^  =  a^  =  . . .  =  a^_^  =  0 


and  (1.''^  '  r   •  r-:-  ^'-~   -^.ll  k  only  for  the  case  of  normality. 

The  teat  of  the  hypothesis  (1.5)  can  be  easily  obtained  In 

-tl  way  as  In  the  following: 
^ "•  u,  a-,...,fi    be  the  least  square  estimators,  and  let 

^e  =  1^  (^1-^-Vil ^k-l^Kk-l))^   ^"^ 

2 


Q„  =  ZZ  (X^  -  X)-  -  Q^ 


where  X  =  -  )   X,  . 

m  ^ i 

Then     r  the  hypothesis  Q^/a  and  Q_/a  are  distributed 
Independently  •-        rding  to  chi-square  distributions  degrees  of 
:r-  m-k  and  k-1  respectively.   Hence,  we  can  reject  the 

hypothesis  if 


Q„/(k-l) 
(1.5)  ^5y^_^  >  F^(k-l,m-k) 


where  F  denotes  the  upper  percentile  of  an  F  distribution. 

The  quantity  Q  can  also  be  expressed  in  the  following  way: 

%  =    ^^     ii:(\-^-  Vii  -  •  •  •  -  ^'k-i^Kk-D  ^^ 


=  ,  -  n^n     n:  (Vl(l)+P2^i(2)+-"-^Vl(k)-^) 

P^  +  ...+\=l 

^'•^^      %  ^."!%=i^  ^^i(^i(i)-^(i))  +  -"-^pk(^i(k)-^(k))^ 


2 


1  "^ 

where  X^  ,m  =  —  )    X 


Hence  if  we  define  s.^  =^  (X. ^  . ^  -  X^  .  ^ ) (X. ^^^  -  X^^^ )/m, 
S  =  {s.,  }  and  S   =  {s   ),  P's  minimizing  (1.6)  are  given  by 


h 

J 

nizsJ" 

and 


P. 
^   I 


Also 


Q^  = Ty-   =  ^   Where  l'  -  {l,...,l}  . 


y~~  (X^  -  X)^  =  m{l'  S  l)/k^  . 
i 


Consequently  the  test  criterion  (1.5)  is  equivalent  to 
(1.7)  ^        ^         ^ 


(l'Sl)(l'S~^l)    1+^  P  (k-l,m-k) 
^     ^  m-k  a^    ' 

In  other  words  under  the  hypothesis  the  left  side  of  (1.6)  is 
distributed  according  to  a  beta  distribution  with  parameters  equal 
to  (k-l)/2  and  (m-k)/2. 

When  we  consider  only  symmetric  alternatives,  that  is  if  we 
assume  that 

f(x)  =   f(-x)  , 
then  the  regression  function  (1.2)  can  be  expressed  as 


(1.8)      x^  =^^  +  Vll^•••■^^k-l^  2  u-li  -"^i 


where 


y    -  'IJ   'i(n-J)  _  ^l(J-fl)""l(n-J)  ""Kj)  "^Kn-J-H) 


J  -  1 >  2, . . . , [  ^  J 


squares  for  the  error  by 


Y^[X^-:-;^Z.^-  ...-y 


z 


2 


^^'^r-'^k-i.    ^    ^  [^    ^f-2-1 

1-5-J 

m 


I'S  -^1 

where  S  is  a  square  matrix  of  order  [  a  ]  whose  elements  are  equal 

^°  ^^iJ+^n-i+l,n-J+l  +  ^i,n-J+l  +  ^j,n-i+l)/^'  ^'J  =  l,...,[i^]. 
The  hypothesis  is  rejected  when 

K    (n  (^  -x)2  -Q  )/f 

(1.10)      ;^  =  ^ ^ >  F^(f,m-f-l) 

Qg        Q^dn-f-l) 

where  f  =  [^^] . 

Combining  the  above  two  we  can  obtain  the  following  table  of 
the  analysis  of  variance. 


Table  1 


Factor 

Sum  of  squares 

Degrees  of  freedom 

Non-Normality 

Asymmetry 

Error 

k-  ['^+1] 

m-k 

Total 

>   (x.-x)2 

m-1 

If  we  have  more  specific  alternatives  in  minci,  we  can  further 
decompose  the  sum  of  squares  for  Non-normality  into  several 
components. 

A  numerical  example:   Following  are  8  sets  of  4  ordered 
random  numbers  from  an  exponential  population: 


.178 

.281 

.528 

1.463 

.589 

.621 

.842 

2.896 

.687 

.906 

1.831 

1.906 

.387 

.655 

.9^3 

2.966 

.027 

.281 

1.341 

1.661 

.205 

1.081 

1.311 

2.142 

.526 

.836 

2.049 

4.110 

.064 

.244 

.477 

.520 

From  this  the  moment  matrix  of  the  order  statistics  is  given. 


4371 


mS  = 


3428 

.5118 

1.1926 

7135 

.8769 

1.2897 

1.5124 

2.6197 
8.4548 

And 
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m  o  1 


/2.5- 
/  l.l6n 


,286 


hence 


And 


m-h'S-h  = 


1.    N-1 


^  -  md's-^l) 


=  .331 


^   (X,  -x)^  =  (I'sDA  =   1.550 


Q^  =  1.550  -  .351  -  1.219 


Q„/5 


,406 


^  =  irr^  =  Tm=  "^-^^ 


^  ■  ■    •   'flcant  at  10 '/o  level  but  not  at  5  Sfc  level.   If 
:e  symmetry  into  consideration. 


^   ,  3.819   1.191  \ 
mS  = 


m 


995 


d'S"  1)"  =  1/(1.095  -  .089)  =  .996,  from  which  we  have 


the  f  ""   ■ ng  table  of  the  analysis  of  variance. 


Table  2 


Factor 

S.S 

d.f . 

V 

p 

•:     rmallty 

.554 

1 

.554 

6.67 

Asymmetry 

.665 

2 

.555 

4.01 

Error 

.551 

>i 

.083 

Total 

1.550 

7 

8 


Both  factors  are  not  significant  at  5O/6  level,  and  the 
non-normality  (given  symmetry)  factor  is  10  96  significant  and 
the  asymmetry  factor  Is  almost  10  96  significant. 
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§2.  Some  Dlscussio.         Power  of  Tests 

The  exact  power  of  ti  ■         ven  In  (I.5)  or  (1.10)  Is 
difficult  to  compute,  but  the  approximate  values  can  be  obtained. 

Prom  the  discussion  of  tht       .3  section  It  is  easily  seen 
ty^at  U.  In         ■  iual  to  the  best  linear  unbiased  estimator 

^  ■-.--'  -   -'  .-  -  ^-   -■-,-'--•-   ^'  -'-e  k.      And  it  is  known 
that  :  oly  regular  distributions  the  BLUE  is  distributed 

.  ,  iroximately  normal  distribution  except  for  very 

■ ^  '  we  can  assume  that  the  X,  are  approximately  normal, 

2 

if  the  underlying  distr:     :.  has  the  finite  variance  a   .   There- 

-   ,  .     .  assume  that  Q  and  Q  In  (1.4)  are  distributed  approxi- 
mately a  a  multiple  of  independent  chi-square  distribu- 
tions, wit:      3 


(2.1) 


E(Q^)  =  (m-k)  a^LUE 


E(Q^)  =  (m-1)  a^/k  -  (m-k)  a^^^ 


From  the  above  relation  we  can  compute  the  approximate  power 
of  the  test  against  fairly  regular  alternatives. 

For  example,  suppose  that  k  =  5  and  m  =  9,  and  consider  the 
alternative  of  the  double  exponential  distribution,  with  density 

f(x)  =|e-l^l  . 


o  p 

In  this  case  a  =  2  and  Op^jjj^   for  k  =  5  is  calculated  to  be 


.317.   Hence 


10 


E(Qg)  =  1.268 


E(Qg^)  =  1.932 


If  we  take  the  level  of  the  test  to  be  5  9^o  ,  then  the 
hypothesis  is  rejected  when 

^a/"^   ^a   . 

_^  =  _^>  6.39  =  Fo.05^^-^)  • 

Hence  the  power  under  the  double  exponential  alternative  is 
approximately, 

Pr{y^-||^  X  F  >  6.39)  =  Pr[F  >   4.19)  =  •  O98 

that  is,  the  power  would  be  approximately  10  56  . 

If  symmetry  is  assumed,  then  we  have  in  a  similar  way, 

E(Q^)  =  6  X  .317  =  1.9029 
e 

E(S  )  -  8  X  .400  -  1.902  =  1.298 
Now  the  test  criterion  is 


'^   >  5.14 


Q/6 


The  power  is 

^""^l!  902/6  ""  ^  ^  ^'^^^  =  ^""^^  ^  ^'^54^  =  -^5  . 
Now  the  power  is  approximately  35  9^  . 

11 


..-  .      ..^  ^^.■.^.    ..  J^5  9^  may  not  be  quite  Impressive,  for 
a  .  .is  not  too  low  compared  with  the 

of  c-  i  In  Shapiro  and  Wilk  [6].   Their  test 

i...^^..   .^^...^  -^  ^K,    .^-uer  than  most  of  the  other  "usual"  tests  has 

96  power  for  n  =  45.   (Judging  from  the  graph  given 
in  their  paper. ) 

From  the  above  discussion  it  is  obvious  that  if  the  normal 
approximation  under  the  alternative  is  sufficiently  accurate,  then 
the  test  is  unbiased.   Also,  if  for  some  k  and  for  an  alternative 

(2.2)  a^Luj,  <  a^ 

then  for  fixed  k,  Q  /(m-k)  -►  o^jjj^   and  Qi^/{k-l)   -►  00  as  m  increases, 
hence  the  test  is  consistent  against  those  alternatives  which 
satisfy  (2.  2). 


12 


§3'  A  Test  Based  on  Conditional  Variance-Covarlance  Matrix 

The  test  procedures  proposed  In  §1  have  the  disadvantage  In 
that  we  need  to  have  many  replications  of  sets  of  order  statistics 
of  the  same  size.   Hence  if  we  are  given  a  sample  of  size  n  =  mk 
it  is  necessary  to  divide  the  sample  randomly  into  m  sets  of  the 
same  size.   Such  randomization  procedure  is  intuitively  undesirable, 
and  may  result  in  the  loss  of  power  of  the  test. 

But  if  we  look  at  (1.7)  we  can  see  that  the  essential  point 
in  our  procedure  is  to  obtain  an  estimator  of  the  variance- 
covarlance  of  order  statistics  of  size  k.   As  was  already  discussed 
in  another  paper  of  the  author  [7  ],  it  can  be  obtained  in  the 
following  way. 

Let  X/^v  '^    .  .  .    <   ^fn)  ^®  ^^^  order  statistics  of  size  n  (>  k). 
Consider  the  fictitious  random  subsample  of  size  k,  and  let  the 
order  statistics  obtained  from  this  subsample  be 

Y ,  ,  <     <  Y 

Let  T  be  the  conditional  expectation  of  Y,    ,  given 
X/,x  <    ...  ^  X/  N  which  is  expressed  as 


^5-1)  Ta  =  E(Y^^)IO^)  =IZpiX(,) 


1, 

_  F 

i 
Then 


E(TJ  =  E(E{Y(^)|0^))  =^  ,^ 


where  M-~|i^  denotes  the  expectation  of  the  a-th  order  statistics  of 
the  sample  of  size  k. 


13 


Also   if  we   take   th-  iitlonal   ex;  !ct 

of   the  order  statistics 

^--  E(.  )   =ZZ^^Pa^X,^)X(j,    . 

Th*:  !    .         '  3  equal   to 

'^aPlk  "^  ^a|k  ^'Plk 

whe  '-^e  covariance  of  the  order  statistics. 

Hence  if  we  define 

the;-  '.-•'- ^n  n  is  large 

e  we  can  put 

•.imator  of  the  variance-covariance  matrix  of  the  order 
c  '"•■'■''  c  s . 

The  c  in  (3.1)  and  (5.2)  are  already      .  in  [  7  ], 


14 


p^   =    .    nC^   -,         .C,       /  C,  for  a  <   1   <   n-k+a 

^a        1-1  a-1   n-i   k-a' n  k  — 

pift   =    •    nC^    T     ■    •    -,Cn   ^   1  .C,     «/   C,       for   a   <   1,    ^   <   n-j+k 

■^ap        1-1   a-1    j-i-1   p-a-1   n-j    k-p' n  k  _     '    -^  _        ^ 

a   <   P,    i   <   j 


(5.^)      Pai=Pi 


And  analogously  to  (1.7)  we  may  obtain  a  test  criterion  by 


(3.5)  P  =      ^ 


(l'Sl)(l'S  ^1) 


since 


I'Sl  =  V(^^Y(^)Y(p)|0^) 

^  k(n-k)   ^  ^^1"  ^^ 
n         n-1 


P  and  )>    (X.  -  X)   are  independent,  when  the  underlying  distribu- 
tion is  normal,  and  P  is  always  not  greater  than  1. 

Although  the  exact  distribution  of  P  under  the  normal  distribu- 
tion is  difficult  to  obtain,  it  would  be  expected  that  the  distribu- 
tion would  be  approximately  equal  to  a  beta  distribution. 

Also  we  can  compute  the  symmetric  case  by  defining  S  from  S 
exactly  the  same  way  as  was  done  in  §1,  and  we  can  define  the  test 
statistic  by 

(3.6)  ^       "^^ 


(i'si)(i's  ■'■I) 
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§4.  Some  Mont 


*v 


vesf  ,  distribution  of  the  3 

St-  'ji-rormed  a  Monte  Carlo  experiment. 

For  c  .15  and  20,  N  =  2000  samples  are  drawn 

fr    ...: —  -.  ... _^  jome  n. :.-;._  rmal  dlstriL^„^^;.j  (which  will 

be  ow).   Empirical  cumulation  distributions  and  sample 

mputed . 
Table  3  represents  the  sample  moments  of  ^  up  to  the  fourth 

rmal  hypothesis. 
As  was  expected,  the  distributions  are  fitted  to  P  distribu- 
•„^^;.  very  well,  and  the  parameters  of  the  beta  distribution  with 
density 

,       {fT/2)-l      (fp/2)-l 
(4.1)  .^  .        X   ^      (1-x)  ^ 

P(-^-^) 

are  e^ -- ..ia>.ed  by  fitting  the  first  two  moments. 
These  values  are  given  in  Table  4. 
A  little  more  roughly,  we  can  expect  that  the  quantity 

^7x2 


p  x"^  (X^  -X) 


2 

be  distributed  approximately  according  to  a  X  -distribution,  with 

degrees  of  freedom  given  by 

<i)  =  E(p  X  ^  (X^-  X)^)  -  (n-l)E(B^)  . 

roximatlon  Is  accurate  enough,  we  can  approximate 
the  distrlt       :  p  by  a  beta  distribution  with  parameters  ^/2 
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Table  3.   Sample  moments  of  ^ 


n=10 

EO) 

E(P^) 

E(P^) 

E(P^) 

k-  3 

.9034 

.8304 

.7720 

.7237 

4 

.8451 

.7410 

.6656 

.6081 

5 

.7335 

.5806 

.4807 

.4101 

6 

.6446 

.4661 

.3610 

.2928 

7 

.4906 

.2969 

.2017 

.1475 

8 

.3647 

.1850 

.1125 

.0765 

n=15 

k=  3 

.9398 

.8892 

.8455 

.8074 

4 

.9005 

.8235 

.7620 

.7116 

5 

.8439 

.7332 

.6501 

.5852 

6 

.7925 

.6555 

.5584 

.4860 

7 

.7291 

.5665 

.4591 

.3832 

8 

.6678 

.4853 

.3727 

.2974 

n=20 

k=  3 

.9576 

.9200 

.8862 

.8557 

4 

.9288 

.8695 

.8194 

.7762 

5 

.8921 

.8074 

.7389 

.6822 

6 

.8565 

.7492 

.6659 

.5993 

7 

.8166 

.6871 

.5912 

.5175 

8 

.7764 

.6272 

.5215 

.4434 
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rab: 

' 

■■  ^.  ■  ■  ■ 

freedom 

^1 

'2 

* 

k=  3 

10.22 

8.13 

4 

6.57 

1.20 

7.61 

5 

5.27 

1.91 

6.60 

6 

h.-^' 

2.51 

5.80 

7 

5.35 

5.46 

4.42 

8 

2.55 

4.40 

3.28 

n=15 

k=  3 

15.85 

1.02 

15.16 

4 

10.55 

1.14 

12.61 

5 

8.90 

1.64 

11.82 

6 

7.96 

2.08 

11.10 

7 

6.80 

2.52 

10.21 

8 

6.18 

5.08 

9.55 

n=20 

k=   3 

24.01 

1.06 

18.20 

4 

16.47 

1.26 

17.65 

5 

15.03 

1.57 

16.95 

6 

11.70 

1.96 

16.27 

7 

10.44 

2.34 

15.52 

8 

9.50 

2.73 

14.75 
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Table  5.   Approximate  critical  points 


a=0.01 

n^ 

5 

4 

5 

6 

7 

8 

10 

0.44 

0.35 

0.16 

0.11 

0.04 

0.02 

15 

0.61 

0.52 

0.39 

0.31 

0.23 

0.18 

20 

0.74 

0.65 

0.54 

0.48 

0.40 

0.35 

a=0.05 

n 

10 

15 

20 

0.67 
0.80 
0.85 

0.51 
0.66 
0.75 

0.33 
0.53 
0.66 

0.24 
0.46 
0.60 

0.11 
0.37 
0.53 

0.06 
0.30 
0.47 

jM   m 

ii 

a=0.10 
n 

i! 

rRSITY 

-LIBRARY 

10 

0.75 

0.60 

0.43 

0.33 

0.18 

0.09 

15 

0.85 

0.73 

0.63 

0.55 

0.45 

0.38 

20 

0.89 

0.81 

0.74 

0.67 

0.61 

0.54 
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r  that  of 


,^,X,-X,^-^)/,n-M,  ^^_    ^^„„.„       , 


v7*  "-»<  I's-^i 

by  an  F  distribution  with  freedom  n-({)-l  and  ij). 

-■■-    -^   -  ,  fitted  L^,  ...^  .  irst  sample  moments  are  also 

^able  h.      Also  approximate  percentage  points  are  given  in 
Table  ;  . 

It  is  checked  that  the  above  two  approximations  fit  closely 
' gher  order  moments  obtained  from  the  sample. 
And  when  n-k  is  not  small,  it  is  seen  that 

(4.3)  4>  ^  n  -  ^  . 

The  power  of  the  test  is  also  computed,  and  is  given  in  Table  6 
under  the  following  alternatives: 

Contaminated  Normal  (CN)  with  the  density 

f  (x)  =  0.90  X  -^  e-""  /^  +  0.10  X  — ^  e"""  /^^ 
/2?  /TBi^ 

Logistic  (Lo) 

f(x)  =  e-Vd+e""")^ 


Double  Exponential  (DE) 


f(x)  =ie-(^) 
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Special  Distribution  (Sp)  random  variable  X  is  given  by 


^  =  072  ^^'°*^  -  (i-u)"°''^) 


where  U  is  a  uniformly  distributed  random  variable  t  distribution 
with  d.f.  2  (t2) 


Cauchy  ( C ) 


f  (x)  =  const  X  o   -2,/o 


f(x)  =    1 


7r(l+x^) 


Rectangular  (R) 


f(x)  =  1  ,    0  <  X  <  1 


Since  the  frequency  distributions  were  tabulated  with  fixed 
Intervals  (.01),  we  could  not  get  exact  points  corresponding  to  the 
level  of  the  test.   Those  points  which  give  the  nearest  values  to 
the  level  for  the  normal  case  were  chosen  and  given  in  Table  5* 
And  frequencies  of  the  rejection  of  the  hypothesis  among  N  =  2000 
samples  are  tabulated  in  Table  6. 

Also  powers  corresponding  to  a  =  0.10  with  some  of  the  results 
quoted  from  Shapiro  and  Wllk  [6]  are  given  in  Table  7  (power  in 
percentages),  where  K^  and  K(-  denotes  our  test  for  k  =  5  and  k  =  5 

respectively;  and  W  =  Wllk  and  Shapiro's,  KS  =  Kolmogoroff- 

_  2 

Smirnoff's,  WCM  =  Weighted  Cramer-von  Mises',  x  =  chl-square  tests, 

bp  =  test  based  on  the  4th  order  moment. 

Our  test  is  shown  to  be  almost  uniformly  more  powerful  than 
any  other  tests,  and  it  is  remarkable  that  ours  has  a  power  similar 
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Tal  ^  Frequ-       ■ .         "         ■,     <  ■  <-  the  normal 

hypot  ng  N  jases 


k= 

3 

4 

5 

6 

7 

8 

N 

20 

18 

20 

17 

21 

16 

CN 

157 

133 

70 

50 

42 

40 

Lo 

74 

71 

33 

30 

36 

30 

DE 

171 

164 

94 

75 

65 

60 

Sp 

123 

119 

60 

45 

46 

37 

t2 

404 

397 

278 

236 

152 

150 

C 

957 

959 

853 

798 

659 

588 

R 

1 
a=0.05 

20 

23 

41 

56 

40 

N 

95 

102 

98 

103 

103 

106 

CN 

325 

278 

247 

211 

179 

171 

Lo 

210 

174 

170 

154 

119 

133 

DE 

405 

332 

316 

269 

215 

222 

Sp 

310 

265 

240 

217 

159 

158 

t2 

670 

599 

559 

522 

400 

356 

C 

1209 

1147 

1188 

1124 

984  • 

916 

R 

37 

195 

194 

230 

153 

107 
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Table   6b 


n=15 

a=0.01 

k- 

3 

4 

5 

6 

7 

8 

N 

18 

20 

20 

21 

21 

22 

CN 

222 

246 

177 

144 

113 

91 

Lo 

67 

65 

61 

53 

41 

32 

DE 

229 

223 

187 

142 

120 

9^ 

Sp 

170 

l64 

125 

99 

80 

73 

t2 

625 

6l8 

593 

521 

446 

391 

C 

1300 

1313 

1358 

1312 

1256 

1213 

R 

0 
a=0.50 

48 

83 

117 

108 

115 

N 

103 

98 

96 

102 

98 

96 

CN 

440 

375 

334 

328 

292 

250 

Lo 

257 

192 

161 

146 

130 

119 

DE 

600 

465 

411 

390 

344 

281 

Sp 

426 

335 

283 

264 

235 

196 

t2 

948 

842 

820 

794 

744 

695 

C 

1595 

1511 

1533 

1538 

1494 

1458 

R 

193 

335 

351 

426 

407 

372 

23 


Table  6c 


n=20 

a-".  *: 

lc= 

5 

If 

5 

6 

7 

8 

N 

20 

19 

20 

20 

21 

19 

CN 

38U 

^^  ^ 

301 

277 

237 

195 

Lo 

109 

112 

86 

83 

74 

61 

DE 

382 

369 

326 

297 

269 

227 

Sp 

264 

255 

215 

196 

161 

131 

t2 

917 

905 

851 

817 

777 

728 

C 

1668 

1669 

1666 

1661 

1634 

1611 

R 

10 
a=0.05 

238 

291 

373 

382 

372 

N 

95 

95 

97 

95 

97 

100 

CN 

575 

527 

499 

486 

444 

405 

Lo 

298 

235 

224 

218 

185 

156 

DE 

699 

600 

578 

554 

507 

459 

Sp 

521 

437 

410 

391 

361 

325 

t2 

1198 

1117 

1096 

1078 

1036 

989 

C 

1788 

1757 

1765 

1766 

1761 

1745 

R 

392 

601 

651 

724 

720 

731 

24 


n=15 


n=20 


Table  ?.   Comparison  of  the  power  of  tests 
power  in  percentages 


n=10 

a=0.10 

Test 

h 

% 

w 

KS 

WCM 

X^ 

^2 

N 

10 

10 

10 

10 

10 

10 

10 

Lo 

16 

14 

12 

9 

13 

14 

12 

DE 

27 

23 

25 

13 

36 

14 

18 

Sp 

24 

18 

13 

9 

9 

10 

10 

t2 

39 

36 

41 

15 

63 

15 

38 

C 

65 

65 

42 

30 

95 

23 

56 

R 

10 

17 

15 

14 

14 

14 

20 

N 

10 

10 

10 

10 

10 

10 

10 

Lo 

18 

15 

14 

6 

8 

12 

13 

DE 

37 

31 

27 

7 

20 

17 

27 

Sp 

27 

23 

13 

7 

7 

13 

8 

t2 

53 

49 

51 

18 

69 

18 

53 

C 

83 

82 

8l 

47 

98 

46 

82 

R 

25 

32 

28 

11 

8 

17 

22 

N 

10 

10 

10 

10 

10 

10 

10 

Lo 

21 

18 

16 

9 

9 

8 

13 

DE 

43 

39 

32 

13 

37 

25 

35 

Sp 

33 

29 

12 

8 

13 

9 

9 

t2 

66 

63 

58 

23 

81 

54 

58 

C 

92 

91 

92 

65 

99 

22 

88 

R 

37 

48 

39 

19 

14 

18 

38 
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.^    ..    .r.-...apiro's,    Lui   alriiuat   always   slightly   '         er   than   theirs. 

Table  6  shows  that  except  for  the  case  of  the  rectangular 
distribution,  tests  with  smaller  k  values  have  larger  power. 
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§5.  Concluding  Remarks 

Monte  Carlo  study  indicates  that  our  test  seems  to  have 
fairly  high  power  at  least  against  symmetric  alternatives.   It  also 
has  the  advantage  of  being  free  of  location  and  scale  parameters. 

The  test  may  have  small  power  against  such  distributions  for 
which  the  sample  mean  has  a  high  efficiency  as  an  estimator  of  the 
location.   But  this  may  not  be  a  serious  disadvantage,  if  the  main 
purpose  of  detecting  the  non-normality  is  to  protect  ourselves 
against  serious  loss  or  misuse  of  Information  caused  from  careless 
application  of  procedures  based  on  the  normality  assumption.   For 
most  of  the  problems  raised  in  the  applied  statistical  research  are 
about  location  parameters  (including  regression  parameters). 

The  null  distribution  of  our  test  statistic  is  yet  to  be 
investigated  more  carefully.   The  beta  approximation  using  two 
parameters  f-,  and  f-  seems  to  be  accurate  enough,  but  the  rougher 
approximation  (^.2)  with  ^  =   n-   (k-l)/2  is  not  enough,  giving  7  or 
8  9^  probability  for  the  error  under  the  hypothesis  corresponding 
to  the  nominal  5  9^  level.   And  the  values  of  f-,  and  fg  or,  equiva- 
lently,  the  moments  of  P  are  difficult  to  calculate  analytically, 
even  approximate  empirical  formulae  have  not  been  found  yet. 
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